Unsupervised Learning

نویسندگان

  • Joachim M. Buhmann
  • Wolfgang Maass
  • Helge Ritter
  • Naftali Tishby
چکیده

What is unsupervised learning and how does it relate to the well founded theory of supervised learning? These questions have been discussed during this seminar which brought together neural modellers, statisticians, computational learning theorists (\COLT people") and theoretical computer scientists and physicists. The eld of machine learning with its broad range of pattern recognition applications in data mining and knowledge discovery, in information retrieval and in classical areas like speech and image processing, computational linguistics or robotics is confronted with various problems beyond classiication and regression. The search for structure in large data sets requires automatic inference tools which can also provide quality guarantees to validate the results. The discussions after the talks and in special discussion sessions circled around two main issues of unsupervised learning: 1. What does it mean to detect structure in a data set and how can we quantify it? 2. How can we provide guarantees that the detected structure generalizes from one sample set to a second one? It is unrealistic to expect a general answer to the rst question. A general theory of structure has not been developed yet and attempt like the inference on the basis of Kolmogorov complexity are debated. One might even argue that such a goal is completely elusive since it encompass the program of natural science, engineering and the humanities. The diierent talks, therefore, covered a wide spectrum of special suggestions how structure could be deened and detected ranging from trees in image analysis, informative projections like PCA or ICA representations of high dimensional data, clusters in vectorial data and histograms as well as groups in relational data or principal surfaces. It became apparent in the discussion of simulation results that uctuations in the data should have little innuence on the learned structures. This requirement might be enforced by bounding techniques as they have been developed for the computational learning theory of supervised learning or by information theoretic compression ideas. The challenges of unsupervised learning for the COLT and 1 the modeling community seem to crystallize around the questions how optimally generalizing structures in data can be discovered and how they are characterized and validated in terms of robustness, compactness (description length) and eeciency for learning. What does biology teach us about unsupervised learning? Apart from the miracle how supervised learning might be organized in the brain at the neuronal level, the biological substrate seems to support unsupervised learning …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

High-Dimensional Unsupervised Active Learning Method

In this work, a hierarchical ensemble of projected clustering algorithm for high-dimensional data is proposed. The basic concept of the algorithm is based on the active learning method (ALM) which is a fuzzy learning scheme, inspired by some behavioral features of human brain functionality. High-dimensional unsupervised active learning method (HUALM) is a clustering algorithm which blurs the da...

متن کامل

An Unsupervised Learning Method for an Attacker Agent in Robot Soccer Competitions Based on the Kohonen Neural Network

RoboCup competition as a great test-bed, has turned to a worldwide popular domains in recent years. The main object of such competitions is to deal with complex behavior of systems whichconsist of multiple autonomous agents. The rich experience of human soccer player can be used as a valuable reference for a robot soccer player. However, because of the differences between real and simulated soc...

متن کامل

Deep Unsupervised Domain Adaptation for Image Classification via Low Rank Representation Learning

Domain adaptation is a powerful technique given a wide amount of labeled data from similar attributes in different domains. In real-world applications, there is a huge number of data but almost more of them are unlabeled. It is effective in image classification where it is expensive and time-consuming to obtain adequate label data. We propose a novel method named DALRRL, which consists of deep ...

متن کامل

INTEGRATED ADAPTIVE FUZZY CLUSTERING (IAFC) NEURAL NETWORKS USING FUZZY LEARNING RULES

The proposed IAFC neural networks have both stability and plasticity because theyuse a control structure similar to that of the ART-1(Adaptive Resonance Theory) neural network.The unsupervised IAFC neural network is the unsupervised neural network which uses the fuzzyleaky learning rule. This fuzzy leaky learning rule controls the updating amounts by fuzzymembership values. The supervised IAFC ...

متن کامل

Presentation of an efficient automatic short answer grading model based on combination of pseudo relevance feedback and semantic relatedness measures

Automatic short answer grading (ASAG) is the automated process of assessing answers based on natural language using computation methods and machine learning algorithms. Development of large-scale smart education systems on one hand and the importance of assessment as a key factor in the learning process and its confronted challenges, on the other hand, have significantly increased the need for ...

متن کامل

Presentation of an efficient automatic short answer grading model based on combination of pseudo relevance feedback and semantic relatedness measures

Automatic short answer grading (ASAG) is the automated process of assessing answers based on natural language using computation methods and machine learning algorithms. Development of large-scale smart education systems on one hand and the importance of assessment as a key factor in the learning process and its confronted challenges, on the other hand, have significantly increased the need for ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999